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Abstract 

In this paper we study the spectral properties of Markov-operator on 
L 2 -spaces. Lawler and Sokal (Trans. Amer. Math. Soc, 1988, 309, 
pp. 557-580) used isoperimetric constants for discrete and continuous 
time Markov chains to obtain a spectral gap at 1. For time discrete 
Markov chains this does not exclude periodic behavior. We define a new 
constant measuring the distance from periodicity and give necessary and 
sufficient conditions for the existence of a global spectral gap in terms of 
this constant. 



1 Introduction 

Several techniques had been developed for studying the stochastic behavior of 
Markov processes. For example, one may analyze the spectral properties of the 
associated Markov operator. A well-known result is the Frobenius-Perron theo- 
rem, which states that for an irreducible, aperiodic and finite state space Markov 
chain £i, £a> • • • we get that 1 is an eigenvalue (according to a left eigenvector n, 
7T is the uniquely determined invariant measure) of P (P the transition matrix 
of the chain) and the absolute values of all other eigenvalues A are smaller than 
one. Moreover, the speed of convergence of the transition probabilities to it is 
geometric and can be estimated in terms of | A2 1 , where A2 is the eigenvalue with 
second largest modulus. The quantity r := 1 — |A 2 is often called the spectral 
gap of the chain. Since for Markov chains with large state space it is often 
impossible to compute A2, one is interested in estimating the gap. In the last 
20 years several papers concerning spectral gaps for Markov chains has been 
published (e.g. [3], 0], [5], [6], 0, 0, [9], [11], [12], [13]). There are mainly 
two reasons for this: On one hand, in the early 1990's Markov Chain Monte 
Carlo (MCMC) methods were developed and statements concerning the speed 
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of convergence needed information about the size of the spectral gap. On the 
other hand, at that time the mathematicians working in this area started to use 
the technique of Dirichlet Forms which proved to be very successful to tackle 
this problem. 

First, Cheeger P] introduced an isoperimetric constant k of a compact Rie- 
mannian manifold M in order to give a lower bound for the smallest strictly 
positive eigenvalue of the Laplacian on M. Since a discrete version of the Lapla- 
cian can be associated with time discrete Markov chains, several authors inves- 
tigated spectral properties of Markov operators using Cheeger's idea (e.g. [3], 
@]j Ej, [IH]) [H], [IS])- A fundamental paper comprising the results of research 
in this area until 1988 is provided by Lawler and Sokal |13j . 
Lawler and Sokal [13] established a simple necessary and sufficient condition 
for reversible, continuous time Markov chains that ensures the existence of a 
spectral gap in terms of isoperimetric constant k. Their estimates concerning 
the size of the gap were partially improved by Chen [3: in 2000. The condition 
mentioned before is still necessary for discrete time Markov chain, but it is not 
sufficient. As far as we know, in this setting no simple condition equivalent to 
the spectral gap property has appeared in the literature. We introduce a con- 
stant K which measures the chain's distance from periodicity. This constant is 
used to obtain a condition that ensures the spectral gap property for reversible 
discrete time Markov chain on a general state space. Moreover, we establish a 
lower bound for the size of the gap in terms of k and K . For this, we use a 
geometric approach which can be generalized to the non-reversible case. 

2 A spectral gap theorem for reversible Markov 
chains 

In this section we introduce the basic notations and remind some well known 
facts that will be needed in the sequel. Throughout this paper we consider 
a positive recurrent Markov chain £1,^2, • • ■ with arbitrary state space (ft, J-), 
transition kernel p{x, dy) and uniquely determined invariant probability measure 
7T. The stochastic kernel p(x, dy) induces a linear operator P on the Hilbert 
space L 2 (ir) by 

Pf(x) := / f(y)p(x,dy). (1) 

Using Jensen's inequality, we see that ||P||2 < 1 and since 1 is an eigenfunction 
of P, we have ||P||2 = 1< Sometimes we will assume that the Markov chain is 
reversible, i.e. 

ir(dx)p(x, dy) = n(dy)p(y, dx). (2) 

This implies that P is self adjoint and the spectrum of the operator P is real 
valued, i.e. 

ff (P)c[-i,l]. 
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The main question of interest is now whether an r > exists with 

lim \\P n f\\l <l-r. (3) 

n — ► oc 

for all / G L2 l(?r) . = {/ e i2(7r) . J o/ ( x)7r(dx ) = 0,/ n /(x) 2 7r(dx) - 1}. The 
supremum over all r > satisfying is called the spectral gap of P. 
In order to estimate r, let us define 

k := inf k{A), (4) 
as introduced in [13] , where 

The quantity measures the probability flow out of the set A to its comple- 
ment. 



Let us introduce a family k n of isoperimetric constants associated with the 
Markov chain £i, £n+i > &n+i i . . .. 

k n := inf k n (A), k n (A) := / p n (x, A c )n(dx), neN. (6) 

From the definition it follows that k = k±. 

Now let us assume that P is self adjoint. We say that the operator P has a 
spectral gap of size r% at 1 if and only if 

a(P) c [-1,1-n], ri>0. (7) 

Accordingly, we say that P has a spectral gap at —1 if and only if a r_i > 
exists such that 

«r(P)c[-l + r_i,l] (8) 

The size of the gap is given by the supremum over all n such that (J7JI 

({SJ respectively) is fulfilled. Lawler and Sokal [I3j proved the following bounds 
for the spectral gap at 1 in terms of k: 

Theorem 1 (Lawler, Sokal) For a reversible, positive recurrent Markov chain 
with uniquely determined stationary distribution tt we know that 

k>n> Jfc 2 , (9) 

o 

where K is a constant larger than one (see [13)). 

For reversible finite state space Markov chain a similar result is provided by 
Jcrrum and Sinclair [12 in terms of conductance (for the definition of conduc- 
tance see e.g. [1]), which is closely related to the definition of the isoperimetric 
constant. 
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In order to have a spectral gap r in the sense of © we should also know the 
spectral gap r_i at —1, since r — min(r_i,ri). The value of r_i measures the 
periodic behavior of the chain in a certain sense. For this reason let us consider 
the following 



Example 1 Let il := {1,2,3,4}. Assume that 

p(i, i + 1( mod A)) =p(i,i— l(modA)) — |, i G fl. Then the invariant measure 
is given by tt — (\, \ , \ , \)- Let us consider the set A = {1,3} and define 
f:=l A -l A c 

One can show that k > and 

Pf = ~f, 

so —1 is an eigenvalue of P. Hence this chain does not have a spectral gap in 
the sense of ^j). 

The following observation is crucial. In the example above we have 

k(A) = 2. 
Let us define a new constant, namely 

K := sup k(A). 

In the way k measures the distance from ergodicity, 2 — K measures the dis- 
tance from periodicity. Since periodicity of order larger than 2 is excluded by 
the reversibility assumption, one may hope that K < 2 is actually a sufficient 
condition. 



We now state the main result of the paper, namely a condition that ensures 
the existence of a spectral gap for general state space Markov chains in terms 
of fc, k2 and K. 

Theorem 2 Let £1,^27 ■ ■ ■ be a reversible, time homogeneous Markov chain with 
stationary distribution tt and transition kernel p(-, •). Then the following three 
conditions are equivalent: 

1. P has a L 2 -spectral gap. 

2. 

< k < K < 2. (10) 
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3. 

k 2 > 0. 

For the spectral gap we obtain the following estimate: 



a(P)c 

where k 2 can be estimated from below by 



1 B ,mi n( Jl kl,l fc 2 ) 

8 2 V V 8 2 ' 8 ; 



(11) 



(12) 



> 



S ,-(e 1 e 2 (l-S)-5), 
'(2-e)(l- ei )(l-e a )(l-<S) 1 



sup 



(l-e)K 



1 



1 



(13) 



It is known that the first inequality is a necessary condition for spectral gap 
property ([H], [I3j)- For continuous time Markov chains, the condition k > 
(k defined by the transition rates g(-, •) instead of the transition probabilities 
p(-, ■)) is indeed sufficient, since periodicity is a phenomenon occurring only for 
discrete time Markov chains. As mentioned above, the third inequality in pop 
excludes periodicity of the chain. 



3 Proof of Theorem 2 

In order to prove the theorem above, one has to establish several lemmas. In 
the following we use the setting above. If reversibility is required, then it will 
be mentioned explicitly. 

Lemma 1 For all A G T holds: 

k{A) = k{A c ). 

Proof: We see that for the proof we only need the stationarity of the starting 
distribution 7r of the Markov chain. 



p(x,A c )Tr(dx) = / p(x, A c )n(dx) - / p(x, A c )it(dx) 
i Jn JA" 

= ir{A c ) - (n(A c ) - [ p(x, A)n(dx)) 
p(x, A)ir(dx). 

a 

The next lemma provides a precise upper bound for k{A): 
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Lemma 2 For all A G T we have 

< k{A) < 2. (14) 

Proof: The first inequality is trivial. In order to show the second one, let us 
consider K := sup Ae:F k(A). Then for every e > there exists A 6 T such that 
k(A) > K — e. For such a set A it follows 

p(x, A c )ir(dx) >{K- e)Tr{A)Ti{A c ). (15) 

Since p(x,A c ) only takes values between and 1, an a with < a < 1 exists 
such that 



p{x, A c )n{dx) = an(A). (16) 

L 

This together with (fT5|) yields 

a > (K - e)ir(A c ). (17) 

By Lemma [T] we also have 

p(x, A)n{dx) > (K - e)n(A)n(A c ). 
So there exists j3 £ [0, 1] such that 

p(x,A)Tr(dx) =(3ir(A c ). (18) 



As above this yields 

13 > (K - e)ir(A). (19) 

Adding $F7$ and (HU) we get 

2>a + (3 > {K - e)n(A) + (K - e)n(A c ) = K - e. (20) 
Since this is true for all e > 0, we have K < 2, which proves the lemma. 

□ 

We saw in Example 1 that K — 2 is possible and the bound was attained by 
some set A with tt(A) = |. The next lemma shows that k(A) > 2 — e is only 
possible for sets A £ T with ir(A) close to \. 

Lemma 3 Let K := sup^-p k(A) = 2. Then for any sequence of sets A n 6 T 
with the property that k(A n ) — > 2, we have that Tr(A n ) — > ^. 

Proof: According to (fTT))) and (|18p . we associate with A„ and the con- 
stants a n and /?„. From k(A n ) > 2 — e n , < e„ — * and (|2H)) with if = 2, 
a = a n , (3 = /3„, e = e„ it follows that a„ -> 1 and /3„ — * 1. Using (fTF)) and (fT9|l 
with if = 2 and keeping in mind the first inequality in (|20[) we can conclude 
that n(A n ) -> i. 
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□ 



We have the following simple relation between k and k n 
Lemma 4 



k = 



k n = Vn e N. 



Proof: 

7r(A)7r(A c )fc„(A) 



p"(x,A c )7r(dx) 



p(x,dy)p n \y,A c )i:(dx) 



I A J A 

„n-l/ 



J A" 

< (*Vi_i(A) + fc(A))7r(A)vr(A c ) < . . . < n fc(A)7r(A)7r(A c ). 

□ 

Proposition 1 Let P: io( 7r ) ~* ^oW ^ e ^ e se ?/ adjoint operator associated 
to the reversible Markov chain £i,£2, • • •■ Then we obtain 



a(P)c 



(21) 



Proof: Since P is self adjoint, P 2 is self adjoint and positive. Applying 
Theorem 1 to P 2 yields 

a(P 2 )c[0,l-|fcf]. 
Using the spectral mapping theorem ( |14j). we obtain 

a(P)c 



□ 



Corollary 1 The Markov chain £i , £2 , • • ■ has a spectral gap if and only if 

k 2 > 0. (22) 

Proof: The sufficiency of the condition follows immediately from Proposition 
[TJ Now let us assume that k 2 — 0. Applying Lemma H] to the Markov chain 
£i>£3>£5, •• •) we obtain that k 2n — for all n G N. Then applying the first 
inequality in ([9]) to the chain £i, £271+1, £471+1, ■ ■ ■ yields 



{1} C cr(P 2 ™) for all neff. 



This finalizes the proof. 



□ 



In order to prove Theorem 2, we have to establish the connection between 
k, K on one hand and k 2 on the other hand. 
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Lemma 5 Assume that K = 2. Then k%=Q. 
Proof: For each A n in T we obtain 



Tr{A n )Tr(A c n )k 2 (A n ) = / / n(dx)p{x,dy)p(y,Al) 

J A n J A n 

■n(dx)p(x,dy)p(y,A c n ) 



i A n J A" n 

< / n(dx)p(x,A n ) + I n(dx)p(x,A c n ) 



n(A n )- / TT(dx)p(x,A n ) + n(A c n ) 
J 

■n(dx)p(x,A c n ) 
1-2 / 7r(dx)p(x.A c n ). 



Now choose a sequence A n € T with k(A n ) — » 2. From Lemma [3] we know that 
tt(A„) -> |. This yields 



□ 



Remark 1 5mce we did not use i/ie reversibility assumption, Lemma[5\is true 
for arbitrary Markov chains. 

Now we come to the most difficult part of the proof of Theorem 2: 

Lemma 6 Let £i , £2 > • • • be a reversible, time homogeneous Markov chain with 
arbitrary state space (CI, J 7 , n). We get the following estimate for fc 2 : 

k 2 k 

k 2 > sup min — S, -(eie 2 (l - S) - S), 

(2-e)(l- ei )(l-e 2 )(l-5) 1 



[l-e)K 1-eJ 1-e, 

Proof: During the proof we use the relationship ir(dx)p(x, dy) = Tr(dy)p(y, dx) 
without mentioning this fact. For AGf we have 



k 2 (A) = / <dx)Ax,An 

1 



n(A)Tr(A c ) \J A 
1 



■jr(dx) / p(x, dy)p(y, A c ) + / Tr(dx) / p{x,dy)p(y,A c ) 



7T(A)ir(A c ) \J A 

inf —TT\ i i 

A€J:7r(4)<i TT(A)T:(A c ) J a 



■jr(dx) / p(x, dy)p(y, A c ) + / Tr(dx) / p(x, dy)p(y, A) 
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and therefore 

k 2 > 



„ ' m L x (A\(Ac\ I ^ I P(^dy)p(y,A c ). (23) 
Aer-.ir(A)<± tt(A)t:(A c ) J a J a 

Hence we can assume without loss of generality that ir(A) < i. Let us define 



Ah :={yeA:p(y,A c )>^}. 



Then we obtain 



/ n(dx) [ p(x,dy)p(y,A c ) > [ n(dx) [ p(x, dy)p{y, A c ) > \ f Tr(dx)p(x, Ah). 

J A J A J A JA k 4 J A 

(24) 

Let us continue by estimating J A ir(dx)p(x, Ak ). For this reason define 

C := A\ n A, 

4 

A e := {x e C :p(x,Ah) > e}. 

4 

We now consider three cases: 

First assume that there exists S A > 0, such that 

tt(C) > 6 A n(A). (25) 

Then it follows: 

= in\ (nc\ f P(x,Ak)ir(dx) + [ p(x, A c )ir(dx) 
7r(C)7r(C c ) J c * J c 

' ( [ p{x, Ak)ir(dx) + eir(C PI A°) + ^(C)j 



< 



- ^c^) h/ {x ' A i )n{dx) + 2e + l 



Choosing e = „ we get 



p(x, Ak)n(dx) > 



<C)n{C<) J Ak Fy ' ^ v A 
s 

and therefore 

J p(x,A±)Tr(dx) > ^tt(C)tt(C c ) > ^5ait(A)tt(A c ] 
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Together with and (23]) this yields 

k 2 (A) > ^5 A . (26) 

Let us dehne 

B ei := {x £ Ak : p(x, A c ) < 1 - ej. 

For the second case we assume that 7r(C) < 5 a tt(A) and an 62 > exists such 
that 

Jr(fl 6l ) > e 2 7r(Aj). 
With these assumptions it follows that 

k ^ = (A\(Ac\ i <dx)p 2 {x,A c )> ( J (Ac , I n(dx)p 2 (x,A c ) 
HA)tt{A c ) J a Tr{A)Tr{A c ) J Ak 

1 



7r(A)7r(A c ) \J A 



n(dx) / p(x, dy)p{y, A c ) + / n(dx) / p(x, dy)p(y, A c ) 



- T - TTTTT^T / n(dx)p(x, B ei ) = - 1 / 7r(dx)p(x, A* ). 

4 

Moreover, we have 
1. 

/ Tr(dx)p(x, A\ n A) = \ n(dx)p(x, B ei ) < n{A c k fli) < 8 A n(A). 
is ei 4 JA\ nA ' 4 

2. 

7r(dx)p(a;, A) > ei7r(S £1 ) > e^^Ak) > eie 2 (l - S a )tt(A). 

4 

Subtracting the first inequality from the second we obtain 

n(dx)p(x, Ah) > (e 2 ei(l - S A ) - S A )ir(A) 



and hence 

k 2 (A)>^(e 2 e 1 (l-S A )-S A ). (27) 

For the last case let us assume that there exists S A , t\ and e 2 with the same 5 A , 
ei and e 2 as before such that 

tt(C) < S a tt(A); 7r(B ei )<e 2 TT(A k ). 
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First, we observe that for small 61,62 and 5 A , the associated tt(A) is bounded 
away from \. This can be established in the following way: 

/ ir(dx)p(x,A c ) > / Tr(dx)p(x, A c ) > / n(dx)p(x, A c ) 

J A JA k JA k nBc 

4 4 1 

> (1 - ei)7r(i4* n B c e J > (1 - ei)(l - e 2 )7r(Aj) 

> (l-e 1 )(l-e 2 )(l-5 A )Tr(A). 

This yields 

H A) > (i-*i)(i-* 2 )(i-H 

Since k(A) < K for all A € J 7 by definition of if, we obtain 

.(A") > (28) 

This inequality will now be used in order to continue the estimation of k 2 {A). 
Define 

H e :={yeA c :p(y,A c )>e}. 



Then we have 



k 2 (A) > -—l—-f 7 T(dx)fp(x,dy)p{y 7 A c ) 

ir(dx) / p(x,dy)p(y,A c ) 



1 

> 



ir(A)ir{A c ) 
1 

n(A)7r(A c ) J A 



> £ TTTT77\ I <dx)p(x,H e ). (29) 



In order to obtain a suitable estimation of L, we consider the probability flow 
out of the set AU Hf: 



k - tt< A 1 i ffcvfff S / K(dx)p(x,H f _ 
tt(AU Hc)ir(H e ) J AuH c 



1 



/ ir(dx)p(x, H e ) + / ir(dx)p(x,H e 
Ja Jn^nA" 



ir(A U H^{H e ) \J A v " yi tf J H c nAC 

tt(AMAc) ^n^) 
~ ir(AUHc)ir(H e ) tt(A U H^)tt{HA 

1 / 7r(ii f c nA c ) \ , N 



n(H e ) V n(A) 
From this we obtain 
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Now we continue to estimate n(H e ) and ^^f^ - ■ It holds 

n(A) > [ Tr{dx)p{x, A) > (1 - e)7r(i7 e c n A c ). 
JH°nA c 

It follows that 

Trim n A c ) 1 
By adding 7r(F e ) to both sides of tt(H^ n A c ) < j^tt(A) we obtain 

and hence 
Using fl2S} we get 

(l-ei)(l-e 2 )(l-5 A )2-e 1 



7r(# e ) > 



X 1 -e 1 - e 

This and ([32]) inserted into (f31"j) yields 

" V 1-eJ 1-e V 7 

Again, inserting this into (f2"5|) , we obtain 

So we proved three different inequalities for ^(-A), which are listed below: 

1. 2 

fc 2 (A) > ^ for n(C) > 6 A n(A). 
16 

2. 

*a(i4)>£(eie a (l-<^)-*A) for tt(C) < S a tt(A), n{B ei ) > e 2 Tr(A h ). 
4 4 

3. 

'(2-e)(l-ei)(l-e a )(l-« A ) 1 



fc 2 (A) > elk 



(l-e)K 

for tt(C) < <5a7t(^), 7r(S £l ) < e 2 ir(A k ). 

(35) 
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As an immediate consequence we obtain 

~k 2 „ k 



k 2 > sup min 

i5,ei,e 2 ,eeR + 



(2-e)(l- ei )(l- e2 )(l-tf) 1 \ 6 



This finishes the proof of the lemma. 



(36) 



□ 



Lemma 7 Let £i,£2, ■ ■ ■ be a reversible Markov chain such that 
0<k = inf MA) < sup MA) := K < 2. 

Then it follows that 

k 2 > 0. 

Proof: Choose ei and £2 in a way such that eie 2 (l — 5) = 26 (e.g. ei = £2 = 

y/ ■ Now choose e and (5 sufficiently small such that the third term of is 

larger than zero. But then (|36[) is also bounded away from zero. So the lemma 
is proven. 

□ 

Proposition 2 For the reversible Markov chain £i,£2>-- - the following holds 
true: 

fc 2 >0 <^ 0<k<K < 2 (37) 
Proof: The proof follows immediately from Lemma [5] and Lemma [7] 

□ 

The proof of Theorem 2 follows from Corollary [1] and Proposition |2J 
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